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Abstract: Skin cancer was among the most lethal illnesses in the globe, 
especially malignant melanoma, benign seems to be a kind of skin cancer 
that would be the most deadly because of its rapid development, affecting a 
huge number of individuals globally. Early diagnosis of skin cancer seems to 
be critical, hence our research employs the YOLO v3 - DCNN architecture 
to discover and categorize the deadliest kinds of skin cancer. Initially, YOLO 
v3 generates the feature map, simultaneously color features are extracted 
using color moments with QuadHistogram, whereas Grey Level Co- 
occurrence Matrix (GLCM) with Redundant Contourlet Transform(RCT) is 
generated texture features, and both (color and texture) features are get 
fused. Then, fused features are fed into the Deep Convolutional Neural 
Network (DCNN) which classifies the different types of skin cancer. Finally, 
our proposed approach is compared with the existing works. As a 
consequence, when contrasted to the baseline techniques, our proposed 
YOLO-v3 -DCNN has a greater accuracy. 


Keywords: Skin cancer, Deep Convolutional Neural Network (DCNN), Deep 
Learning, You Only Look Once (YOLO-Vv3). 


INTRODUCTION 


The skin seems to be the body's largest organ, and it shields it from 
temperature, radiation, as well as illness. This also aids in body temperature 
control including fat as well as water storage. Among the most drastic 
concerns about the skin in the body has been the threat of skin cancer 
infection [1]. Skin cancer is really an infection generated by irregular cells 


which affects humans. Skin cancer starts with cells, which have been the 
fundamental components of the skin; skin cells proliferate as well as split to 
produce new cells. Skin cells age and die on a daily basis, and new ones 
replace them. This meticulous approach might lead to errors from time to 
time. Whenever the skin does not need the them, fresh cells develop, while 
old cells perish when they don't. Excessive cells clump together to form a 
mass of tissue termed as a tumor [2, 3]. If this is not diagnosed early, it may 
have the ability to spread to certain other areas of the body [4]. 


Skin cancer has been typically categorized as melanoma and nonmelanoma 
[5]. Melanoma seems to be a severe, uncommon, as well as lethal kind of 
skin cancer. As per the American Cancer Society, melanoma skin cancer 
accounts for just 1% among all occurrences, but it does have a higher 
mortality rate [6]. Melanoma grows from melanocytes, which are skin cells. 
It begins whenever healthy melanocytes multiply excessively, results in 
formation of a malignant tumour. This will have the potential to adversely 
affect any part of the human body. It is especially common on sun-exposed 
areas such as the hands, face, neck, as well as lips. Melanocytes are already 
responsible for the production of dark colours on the skin, hair, eyes, and 
other body parts. As a consequence, most melanoma tumours appeared 
brown or black in colour. Nonetheless, in a small percentage of instances, 
melanomas do not establish colour and instead look pink, red, or purple [7]. 


Melanoma cancers should only be cured if they have been identified early; 
or else, they spread to the rest of the body eventually kill the sufferer [8]. 
Nonmelanoma tumours have become less complicated to cure than 
melanoma tumours. Melanoma research [9] was among the first to 
emphasis on Computer-Aided Diagnosis (CAD) [10, 11] via applying 
automated strategies for the earlier identification of all skin lesions. Such 
systems utilize classical computer vision technology to extract attributes 
like shape, colour, as well as texture to supply a classification model [12- 
16]. 


Early identification of melanoma has been seen in studies to considerably 
lower the mortality percentage from melanoma cancer [17]. The fact that 
earlier identification of melanoma remains tough, even for specialists, is 
indeed a major issue. As a result, utilizing a strategy to simplify the 
diagnosis might be beneficial to professionals [18-21]. Because of the wide 
variety of skin lesions seen in dermatology, an automated skin cancer 
diagnosis is a difficult undertaking. Various automated approaches for 
detecting and diagnosing skin cancer have been developed throughout the 
years. 


The traditional way of medical picture analysis involves a succession of low- 
level pixel processing technologies. Because of the poor distinction between 
both the surrounding skin as well as the lesion region, accurate 
segmentation of the lesion is also difficult. . Recent advancements in this 
area shown that deep learning has been the most beneficial machine 
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learning approach applied towards the issue and utilized in a variety of 
applications including speech recognition [22], pattern recognition [23], as 
well as bioinformatics [24]. Deep learning methodologies had produced 
amazing outcomes in several areas when compared to other traditional 
approaches to machine learning. Numerous deep learning methodologies 
also being investigated in latest years for computer-dependent skin cancer 
screening. Using these strategies speeds up the diagnostic procedure but 
also lessens human errors. This can affect the effectiveness and quickness 
with which physicians as well as radiologists identify melanoma. Even 
though the CNN design produces superior results, there are significant 
accuracy limitations in extracting features from skin lesions, necessitating 
the development of a new deep learning-dependent architecture. As a 
consequence, identifying skin cancer at an earlier phase, our research 
proposed YOLO v3 - DCNN architecture. This research can be divided into 
the following stages: 


° Initially, You Only Look Once (YOLO)-v3 architecture uses a 
combination of YOLO v2, Darknet-53, and Residual networks to extract 
features from skin lesions. Simultaneously, color-texture characteristics 
were retrieved by Grey Level Co-occurrence Matrix (GLCM) with 
Redundant Contourlet Transform(RCT) and color moments with 
QuadHistogram. Then extracted features are get fused. 


° In addition, the fused characteristics were fed into the DCNN for 
categorization of skin lesions including malignant, benign, as well as basal 
cell carcinoma (bcc). 


The remainder portions of the paper were assembled as follows: Section 2 
gives a review regarding feature extraction as well as categorization of skin 
cancer. Section 3 explains the proposed YOLO v3 - DCNN methodology. 
Section 4, elaborates on the experimental results, comparisons, and 
discussions, followed by Section 5, concludes the paper. 


LITERATURE SURVEY 


Much study has been undertaken in latest days employing various artificial 
neural networks for early diagnosis of skin malignancies, which is 
addressed below: 


Xie et al [25] using digital dermoscopy images, established a unique 
approach for categorizing melanocytic tumors as benign or malignant. The 
suggested border characteristics discussed here were supposed to be 
insensitive to lesion object incompleteness. The suggested system 
functioned in three steps. Initially, a self-generating NN was being used to 
retrieve lesions from  photographs.The second phase retrieved 
characteristics such as tumor boundary, texture, and color information. 
Feature dimensionality reduction was utilized to remove less relevant or 
noisy features, resulting in improved classification performance. 
Subsequently, lesions were categorized utilizing a NN ensemble system in 


the final phase. Ensemble NN optimizes categorization effectiveness by 
incorporating back propagation (BP) with fuzzy neural networks. 


Yu et al. [26] offer a novel melanoma detection strategy depend on 
extremely deep convolutional neural networks (CNNs). We seamlessly 
merge the two phases and provide an automated framework that does not 
require manual intervention. A fully convolutional residual network (FCRN) 
containing 16 residual blocks is being used in the segmentation step to 
optimize effectiveness. During categorization, the chosen strategy would 
use an average of SVM plus softmax models. That has an 85.5 % accuracy 
for melanoma categorization using segmentation as well as an 82.8 % 
accuracy without segmentation. Among the most serious barriers in 
deploying deep CNNs with medical image processing tasks is indeed a 
scarcity of high-quality training set. 


Dorj et al. [27] established a strategy for categorizing skin lesion 
photographs into four distinct groups. To satisfy the necessity for an 
intelligent as well as rapid skin cancer categorization, a pre-trained deep 
CNN termed AlexNet is being used for extraction of features, accompanied 
by an error-correcting output coding SVM functioning as a classifier. 


Nida et al. [28] suggested an innovative methodology centered on RCNN as 
well as FCM clustering enabling accurate & automated Melanoma area 
segmentation among  dermoscopic pictures. A deep Region- 
dependent convolutional neural network (RCNN) effectively recognizes the 
numerous afflicted areas mostly in form of bounding boxes utilizing Fuzzy 
C-mean (FCM) grouping, that facilitates localization. 


Adegun et al [29] introduced a unique paradigm for automated skin cancer 
detection which includes segmentation as well as categorization of skin 
lesions. A deep learning-dependent CAD architecture comprised of a multi- 
scale encoder-decoder segmentation system as well as an FCN- 
dependent DenseNet categorization network has also been introduced for 
the recognition as well as categorization of skin lesion photos in order to 
recognise skin cancer illnesses. 


Akram et al. [30] suggested an unique methodology for skin lesion 
categorization which utilizes deep feature data to create the far more 
discriminant feature vector whilst preserving the initial feature space. 
Following fine-tuning, they incorporated the obtained data from the chosen 
pre-trained systems, that considerably elevated categorization accuracy. In 
the recommend approach, the researchers utilized less than 3% of the total 
characteristics, that not only enhances categorization accuracy by 
eliminating redundancies but also minimizes computing time. After 
execution of this approach, researchers make the following claims: (a) 
fusion of retrieved characteristics from a collection of pre-trained systems 
promotes overall accuracy; while (b) the inclusion of an attribute selection 
plus dimensionality reduction phase enhances categorization outcomes 
greatly. 


Khan et al [31] used a unique technique that not only identifies skin cancer 
but also classifies it. The suggested technique is basically focused on 
saliency assessment as well as the identification among the most 
discriminating deep characteristics. To enhance lesion distinction, the 
suggested Gaussian system is employed, followed by a colour space transfer 
from RGB to HSV. By keeping that foreground and background as distinct 
as possible, the new color space aids in the building of saliency maps that 
use inner and outer discontinuous windows. The inception CNN 
architecture on two basic output tiers is being used to capture deep 
information from segmented photos. Those retrieved characteristics are 
then fused utilizing suggested decision-controlled parallel fusion approach 
before even being chosen employing the suggested window distance- 
controlled entropy characteristics selection technique. The most 
differentiating characteristics were then put to categorization. 


Khan et al. [32] developed a novel strategy for skin cancer localization as 
well as diagnosis by merging a deep learning system with an iteration- 
controlled Newton-Raphson (IcNR) oriented attribute selection approach. 
This suggested system comprises of three fundamental steps: lesion 
localization utilizing a faster region-dependent convolutional neural network 
(RCNN), deep characteristic retrieval, as well as feature selection that use 
the IcCNR approach. There in localization step, a novel contrasted stretching 
strategy centered on the bee colony method (ABC) has been applied. The 
augmented images, together along with underlying ground truths, were 
then fed into Fast-RCNN, resulting yields segmented pictures. A pre-trained 
model, DenseNet201, has been utilised to retrieve deep features via 
transfer learning, which would then be exposed to a decision stage through 
the suggested IcNR approach. Ultimately, the most discriminating 
characteristics are categorized employing layered feedforward neural 
networks. 


Khan et al. [33] developed a totally automated computer aided diagnostic 
(CAD) solution relying on the deep learning framework. The initial 
dermoscopic images were once pre-processed employing the decorrelation 
formulation strategy, and also resulting images are then supplied to the 
MASK-RCNN for lesion segmentation. This MASK RCNN network is again 
trained utilizing the segmented RGB pictures obtained from the ground 
truth images. Incorporating a GrabCut algorithm as well as an Adaptive 
Neuro-Fuzzy Classifier (ANFC) framework. 


Sikkandar et al [34] propose a novel segmentation-dependent categorization 
framework for skin lesion identification. Initially, a Top hat filter as well as 
inpainting techniques were used in the preprocessing stage. The 
preprocessed pictures are then segmented using the Grabcut method. The 
feature extraction approach is then utilized in collaboration with a deep 
learning-dependent Inception model. Ultimately, the dermoscopic pictures 
were categorized using an adaptive neuro-fuzzy classifier (ANFC) network. 


Saeed et al. [35] proposed an advanced skin cancer detection and 
classification method based on DCNN architectural variations. Transfer 
learning, fine-tuning, the ensemble technique, data creation, and 
augmentation are all viable methods for reducing labeled data insufficiency, 
which is prone to overfitting. It also helps to improve the overall 
effectiveness of skin lesion categorization in CAD systems. 


However, no research has been conducted to increase the accuracy and 
speed of the process by combining artificial intelligence approaches and 
traditional feature extraction methods with YOLO-based methodologies. 


AUTOMATIC DETECTION FOR EARLY DIAGNOSIS OF SKIN CANCER 
AND CLASSIFICATION USING YOLO V3 - DCNN ARCHITECTURE 


Skin cancer is among the most lethal diseases in the globe, therefore early 
identification as well as categorization are necessary. Thus, this paper 
proposed a combined architecture YOLO version 3 and deep CNN has been 
presented for detecting and classifying the skin lesion. It explored a You 
Only Look Once (YOLO)-v3 architecture, which uses a combination of YOLO 
v2, Darknet-53, and Residual networks to extract features from skin lesions. 
The darknet's 53 layers are layered with 53 additional layers for the 
detection head, giving YOLO v3 a total of 106 layers fully convolutional 
underlying architecture. This layer generates the feature map with high 
accuracy. Simultaneously texture and color features are extracted in which 
texture feature extraction can be done employing a Grey Level Co- 
occurrence Matrix (GLCM) with Redundant Contourlet Transform(RCT) to 
provide accurate information about an image's texture, and Color moments 
are being used to extract colour characteristics with QuadHistogram, 
wherein the quadtree decomposition being done to the pictures and 
homogeneous blocks of distinct size are given. Then, the concatenated 
features have been reduced using the PCA method. The YOLO v3 as well as 
PCA features are then combined to make accurate features, which would 
then be fed into a Deep Convolution Neural Network (DCNN) that's been 
trained with the unique ground truths to increase categorization accuracy. 
As a consequence, our proposed technique enhances the identification as 
well as categorization of skin lesions. Fig.,1 depicts the framework of the 
proposed strategy. 


Fig. 1. Structure of proposed approach 
Feature extraction by YOLO-v3 


According to recent research,, risk factors could be considerably minimized 
by detecting it early on, making it almost curable.This prompt identification 
and categorization necessitate the use of an automated system, even though 
the method is rather difficult. Thus our research introduces YOLOv3- DCNN 
architecture to increase the feature extraction as well as categorization 
accuracy. Firstly, the input data was loaded into the YOLO v3 - DCNN 
architecture in which features are extracted through the YOLO v3 
simultaneously the color and texture features are extracted with Grey Level 
Co-occurrence Matrix (GLCM) with Redundant Contourlet Transform and 
color moments with quad histogram. The obtained features then get fused. 


Initially, YOLOv3 uses YOLOv2, Darknet-53, and residual layers, as a novel 
deep neural network architecture. Darknet-53 has 53 convolutional layers, 
the majority of which are 3 x 3 and 1 x 1 filters, as well as the darknet's 53 
layers, are layered with 53 additional layers for the detection head. Instead 
of forecasting the locations for bounding boxes from completely linked 
layers, Darknet leverages anchor boxes. There are 448*448 high-resolution 
classifiers in our input pictures. The size of the anchor boxes in YOLO v2 is 
governed by the input picture. It is decreased by a factor of 32. 


The network creates five bounding boxes containing five coordinates each: 
taw bay, tzw tz and t,,. The width, height, x-coordinate, y-coordinate, as well 
as class label of every annotation in the collection were expressed by these 
coordinates.The predictions 0,,,b,,b,,, and b,, are the predicted bounding 
box coordinates. The cell offsets (Cæ, C,,,) indicate the length of each cell in 
the picture, as well as the size of the bounding boxes (Pw D,,). The 
predictions are determined as follows: 
b =T.) +C,, 


(1) 


b, =0.) +Cy 


(2) 


(3) 


(4) 


Pal Obj) x IOU| b,, obj|= 0%) 
(5) 


Wherep,, seems to be the conditional class probability and IOU is the 
intersection of the predicted as well as ground truth bounding boxes, andg¿ 
) is the change in class label. The input size is reduced by using anchor 
boxes. The network's last layer is composed of 1000 filters, accompanied by 
a final layer with a configurable count of filters. The quantity of filters for 
this layer has been evaluated by the count of classes discovered, as seen 
below: 
N, zfilters 7 5 x ( 5 + N A 


(6) 


For Nasses = 2, the count of filters in the final layer is 35. Whilst YOLO 
approach extracts characteristics and detects objects using area 
information, the combination of feature maps containing popular texture as 
well as colour characteristics was utilized to enhance the algorithm's 
performance. The techniques for extracting texture and color features are 
discussed as follows. 


The GLCM with Redundant Contourlet Transform (RCT) is used to extract 
the image's texture features. The GLCM and RCT are widely used and 
provide accurate information about an image's texture. The texture of a 
picture is described by GLCM via estimating the frequency of pairs of pixels 
in the picture with defined values as well as a defined spatial relationship. 
p,i,,Jj,) was utilized in this approach to obtain the image's pixel values at 
the spatial coordinates i, ċ¿j, Every measurement generates a 416 * 416 
matrix that is utilized to describe each pixel in the picture. 


In our research, the Redundant Contourlet Transform (RCT) generates 
equal-size directional subband pictures. This redundancy gets produced by 
avoiding any downsampling procedure in the Laplacian pyramid approach; 
L lowpass approximations of the picture were constructed by employing L 
suitable low pass filters. When paired with the GLCM, RCT provides high- 
quality data on the whole texture of the picture. The GLCM and RCT 
features have a minimum level of complexity and extract higher accurate 
features. 


The colour characteristics are then reconstructed utilising colour 
consequences utilizing Quad Histogram, in which the images are quadtree 
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decomposed and homogeneous blocks of varied sizes are provided. Aside 
from analysing the global texture properties, an orthogonal combination of 
local binary patterns has also been proposed as a colour predictor for 
dermoscopic skin lesions. 


The color features are extracted color moments with the quad histogram 
are explained as follows: The implementation of a colour moments-based 
colour feature extraction approach to the three colour channels, R, G, as 
well as B, is expanded. In each of the three channels, single as well as multi- 
integrative co-occurrence matrices were also constructed. As a 
consequence, a feature vector of dimensions of 208*208*8 be produced for 
each image. Every feature vector for one single channel approximately 
208*208 in size. There seem to be three channels for a single co-occurrence 
matrix: R, G, as well as B. There have been three extra channels for multi 
co-occurrence matrices: RG, GB, as well as BR. The original image's single 
and multi-co-occurrences are then computed as two distinct channels, 
increasing the dimensions to 8. For proper concatenation, the feature 
vector is resized to 416*416*2. 


Several depths i,of the quad-tree are thus generated, each containing 
range-blocks of the same size. This research presents some feature 
histograms. Let L, denote the greatest depth enforced in the (fractal) 
decomposition with quad-tree refinement, and /, denotes the minimum 
depth. Q,,;,..,thedomain is defined as: 


OQ; K7 li, j)é€LN,|L,si,sL, 1s j,k; \(7| 


Here, i, is the depth in the quad-tree structure and k, is the number of 
feature-bins selected. A function is defined as a histogram h, on®, Lks 
h: Q, 1.x. >R with h,=0 (8) 


If Lied | EQ, ; x, then ha; = hi, jais referred to as the value of h, ati Talh If 
a histogram h, on Q, zx, satisfies the required extra criteria, it is referred to 
as a (weighted) quad-tree feature histogram. 
Borg Weg Vogg 9) 

L; k, 
Where, >) w„ <1; w, 20and v,, ,20;)) v,, ;=1, VIEN, with 1,si,sL, 

i=l j=l 
Equation 9 can be interpreted as follows: It wz; can be interpreted to mean 
weighing the contents of the bins based on depth i,. Also, each depth í, we 
have k, binsV,; ;, the contents of which add up to 1. The color moments with 
quad histogram color features have a minimum level of complexity and 
minimum computation power. Finally, color and texture features are 
obtained through GLCM with RCT and color moments with quad histogram 
methods. 


Then, the texture and color features are merged into a single 416*416*7 
matrix and given as input to the PCA which reduces the dimensionality of 
the concatenated features. The feature map has been generated using the 
PCA output of 7*7*1024. Moreover, the feature map created by YOLO v3 
has 7*7*1024 in size. Both feature maps, including YOLO v3, colour, as well 
as texture features, are then compressed prior to getting concatenated to 
establish a fused feature map. This feature map was fed into the DCNN via 
the feature fusion block, which will then be trained to generate the output. 


Classification based on Deep Convolutional Neural Network (DCNN) 


DCNN is made up of three basic layers: the convolutional layer, the pooling 
layer, as well as the fully connected layer. Each convolution layer in DCNN 
has a large count of weights which are subsampled by the pooling layer to 
give output from the convolution layer whilst lowering the information ratio 
of the layer below. Ultimately, the outputs of the pooling layer were utilized 
to inject into the fully linked layers. Convolutional neuron layers, which also 
include data for a variety of applications such as image categorization as 
well as numerous 2D matrices, are indeed a key component of DCNN. 


This approach may retrieve the regional attributes of the original picture 
depending on the retrieval of local features. This learning procedure's 
principal purpose is to build certain kernel matrices for creating more 
prominent features for the issue of skin cancer identification. In this case, 
the backpropagation (BP) model was adopted to get the lowest feasible 
error value for the system. Sliding window-based convolution has been 
employed for the network. In this research, the rectified linear unit (ReLU) 
would be employed as the activation function for the neurons via the 
function f,|x,)=max|x,,0|.Max pooling can also be used to reduce the 
output network size, so that only the optimum levels were assessed as the 
sliding grid's next layer. 


The BP strategy is indeed a gradient descent-dependent system for 
eliminating neural network error that employs cross-entropy loss as the 
fitness function. This strategy may be described simply as follows. 

N, M, 


L=} U-dlog yz (10) 
Jz & 


Where N, would be the sample number,d,, = a a has 
been the needed output vector, while y,,, seems to be the obtained output 
vector of the m,th class, that may be derived utilizing the preceding 
formula: 


(11) 
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The weight penalty is being used in the creation of the function L, to 
incorporate a value to enhance the weight values: 


MM eG 
Lead 2 —d log Vita nd 2 wp l, 
z dy z L; (12) 


Where, , indicates the weight of the connection, L, signifies the total count 
of layers, whileK, represents the layer l, connections. This DCNN layer 
classifies the different types of skin cancers. Figure 2 illustrates the block 
diagram of DCNN for skin cancer detection. 


Different 
———_ types of 
skin lesions 


Fully connected layers 


Fig. 2. Structure of DCNN in skin cancer detection 


As a consequence, the proposed method accurately detects and classifies 
the different types of skin lesions. Moreover, the next section deals with the 
implementation results of this proposed strategy. 


RESULT AND DISCUSSION 


This section details the implementation findings as well as the performances 
of our proposed methodology. Also, comparison results of the existing works 
are presented. 


Tool : PYTHON 3 

OS : Windows 7 (64 bit) 
Processor : Intel Premium 
RAM : 8GB RAM 


Dataset Description 


The present collection of dermatoscopic pictures is tiny and lacks variety, 
making it challenging to train neural networks for automatic detection of 
pigmented skin lesions. The HAM10000 [36] dataset ("Human Against 
Machine having 10000 training pictures") addresses the aforementioned 
problem. The authors gathered dermatoscopic pictures from a multitude of 
populations, which have been recorded and preserved utilizing a diversity of 
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modalities. Furthermore, the _ resultant dataset contains 10015 
dermatoscopic pictures, which may have been utilized as a training set for 
educational machine learning. The pigmented lesions comprise actinic 
keratoses as well as intraepithelial carcinoma / Bowen's disease (akiec), 
basal cell carcinoma (bcc), benign keratosis-like lesions (solar lentigines / 
seborrheic keratoses and lichen-planus like keratoses, bkl), dermatofibroma 
(df), melanoma (mel), melanoc (angiomas, angiokeratomas, pyogenic 
granulomas and hemorrhage, vasc). Additionally, the lesion id column in the 
HAM10000 metadata file might well be utilized to track lesions with a large 
count of pictures in the dataset. 


Skin lesion detection from the input dataset 


Input Image from the Dataset Detects skin lesion location 


Fig. 3. Skin lesion detection from the input dataset 


Figure 3 illustrates skin lesion detection, by using our novel technique. The 
input image from the MNIST HAM10000 dataset is shown in figure 3(a). 
From the input image (figure 3(a)) the skin lesion is detected using our 
proposed YOLO-v3 approach, as shown in figure 3(b), in which the red color 
box is portrayed as a bounding box. The bounding box indicates the region 
of the lesions for which a high degree of accuracy can be predicted. For the 
categorization process, all retrieved plus conventional characteristics from 
the bounding boxes were fused. 


Skin lesion classification 
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Fig. 4. Skin lesion classification 


The proposed YOLOv3-DCNN being tested for categorization of benign as 
well as malignant skin lesions on the MNIST HAM10000 dataset. The 
characteristics obtained from GLCM, RCT, and CLCM are integrated with 
YOLO-v3 features which are depicted in figure 4(a). Then, the extracted 
features are classified using the proposed DCNN approach, which is shown 
in Figure 4(b). Furthermore, our results indicate that our proposed 
approach can detect skin lesions earlier and with greater accuracy. 


Performance Parameters 


This section describes the performance of our proposed technique whereas; 
various parameters such as accuracy, precision were employed to assess 
the effectiveness of the unique technique to identifying as well as 
categorizing skin lesions. 


Accuracy 


The most basic intuitive performance metric is accuracy, which seems to be 
the ratio of precisely predicted observations to all observations. The 
accuracy is formulated as follows (13) 


TP+TN 
TP+ FP+ FN+ TN 


Accuracy = 
(13) 


Where, TP- True Positive, TN-True Negative, FP- False Positive, FN-False 
Negative 
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Accuracy of Proposed Model 
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Fig. 5. Accuracy of the proposed model 


Figure 5 depicts the accuracy of our proposed skin lesion detection 
approach. When YOLO-V3 characteristics were integrated with GLCM, RCT, 
as well as CLCM features, our proposed architecture's accuracy improves. 
The accuracy of our suggested technique is 95%, which describes the 
effectiveness of the feature fusion approach. 


Precision 


Precision has been described as the ratio of correctly predicted positive 
observations to total predicted positive observations. The precision is 
calculated by equation (14), 
TP 
TP+FP 


Precision = 
(14) 


Precision of skin lesions 
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Fig. 6. Precision of the proposed model 


The precision of the proposed skin lesion categorization approach is shown 
in figure 6. The DCNN technique improves the precision of the proposed 
architecture. The precision of our suggested technique attains the highest 
value of 100% and the lowest value of 91%, which describes the 
effectiveness of the classification approach. 


Training accuracy and validation accuracy of the proposed model 
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Figure 7 depicts the training as well as validation accuracy of our proposed 
approach. In figure 7, green color graphical lines illustrate the training 
accuracy, whereas brown color graphical lines illustrate validation 
accuracy. 


THE ACCURACY OF THE TRAINING AND VALIDATION PHASE OF THE MODEL 
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Fig. 7. The proposed model's training as well as validation accuracy 


The accuracy of training and validation is increasing over time. Our 
proposed model reaches 0.97 training accuracy and 0.975 validation 
accuracy over 9 epochs. Furthermore, the training accuracy exceeds the 
validation accuracy. 


The proposed model's training as well as validation losses 


Figure 8 depicts the training as well as validation losses of our proposed 
model. In figure 8, orange color graphical lines illustrate the training loss, 
whereas violet color graphical lines illustrate validation loss. 


THE LOSS OF THE TRAINING AND VALIDATION PHASE OF THE MODEL 
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Fig. 8. The proposed model's training as well as validation losses 


Training as well as validation loss gradually reducing with time. Figure 8 
shows that the proposed model obtained less than 0.1 train loss in 9 epochs. 
In 9.5 epochs, the proposed model obtained less than 0.1 validation loss. 
Furthermore, the training loss is less severe than the validation loss. 
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Confusion Matrix 


THE MODEL CONFUSION MATRIX WITH TESTING DATA 


w - 2 1 17 0 
ò i 2 5 
Fig. 9. Confusion matrix 


Figure 9 displays the skin lesion recognition confusion matrix. The proposed 
model produces the confusion matrix, in which the diagonal values indicate 
the count of tuples properly identified by the models and the off-diagonal 
values denotes the count of tuples erroneously categorized by the models. 
The better the performance, the higher the diagonal value. 


Comparision Results 


This section explains the proposed technique's comparative outcomes, in 
which our novel technique is compared to the baseline approach, such as 
Grey Level Co-occurrence Matrix (GLCM) with Gabor and Color Level Co- 
occurrence Matrix (CLCM) [37], Grey Level Co-occurrence Matrix (GLCM) 
with Gabor features [37], Color Level Co-occurrence Matrix (CLCM) with 
Gabor features [37], and Gabor with Color Level Co-occurrence Matrix 
(CLCM) [37]. 


Table 1. Overall Accuracy 


Methods Accuracy 
(%) 
GLCM + Gabor + 94 
CLCM [37] 
GLCM + Gabor 90 
[37] 
CLCM + Gabor 91 
[37] 
Gabor + CLCM 88 
[37] 
GLCM+RCT+CL 95 
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Fig. 10. Accuracy Comparison 


Figure 10 illustrates the overall comparison of the feature fusion accuracy. 
The proposed technique attains higher accuracy by incorporating Grey 
Level Co-occurrence Matrix (GLCM) with Redundant Contourlet 
Transform(RCT) and Color moments with QuadHistogram. Our proposed 
approach compared with the baseline GLCM + Gabor + CLCM [37], GLCM 
+ Gabor [37], CLCM + Gabor, and Gabor + CLCM [37] such as 94%, 90%, 
91%, and 88%. As a consequence, our proposed approach has a superior 
accuracy of 95% than existing methodologies. 


Table 2. Overall Precision 


Methods Precision (%) 
GLCM + Gabor + 92 
CLCM [37] 
GLCM + Gabor 87 
[37] 
CLCM + Gabor 86 
Gabor + CLCM 85 
GLCM+RCT+CL 94 
CM (Proposed) 
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Fig. 11. Precision Comparison 


Figure 11 depicts the precision for the overall feature fusion comparison. 
The precision of the proposed technique attains a higher precision by 
incorporating Grey Level Co-occurrence Matrix (GLCM) with Redundant 
Contourlet Transform(RCT) and Color moments with QuadHistogram. Our 
proposed approach compared with the baseline GLCM + Gabor + CLCM 
[37], GLCM + Gabor [37], CLCM + Gabor, and Gabor + CLCM [37] such as 
92%, 87%, 86%, and 85%. As a consequence, our proposed technique has a 
greater accuracy of 94 % than existing techniques. 


Table 3. Model Accuracy 


Methods Accuracy (%) 
MobileNet [37] 80 
SSD-MobileNet 90 

[37] 
Faster R-CNN 92 
[37] 
YOLO-Vv3 - 95 
DCNN 
(Proposed) 
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Fig. 12. Model Comparison 


Figure 12 depicts an overall comparison of model accuracy. The proposed 
model attains higher accuracy by incorporating YOLO-v3 - DCNN. Our 
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proposed approach compared with the baseline MobileNet [37], SSD- 
MobileNet [37], and Faster R-CNN [37] such as 80%, 90%, and 92%. Thus 
our proposed technique has obtained an accuracy of 95% which is higher 
than the existing techniques. 


CONCLUSION 


Early illness diagnosis is critical in the healthcare and medical fields for 
treatment planning. Treatment and consequent mitigation were critical for 
survival if illnesses are recognized early. Also with growth of numerous skin 
disorders as a result of pollution and other causes, the categorization of 
surface-level skin cancers as well as diseases is becoming an important 
subject of study. It is incredibly hard to acquire labelled data that would aid 
categorization models. Furthermore, low picture quality, fuzzy regions, 
including noise such as hair or sweat in dermoscopic pictures may have an 
effect on model training. Moreover, the proposed method obtains color- 
texture characteristics from input pictures, which are then concatenated 
with DCNN features for further FCN training, significantly optimizes the 
classifier's performance. In terms of accuracy and categorization indices, 
the proposed YOLO v3 - DCNN method outperforms previous models. 
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